FASTUS: Extracting Information from Natural- Language Texts

نویسندگان

Jerry R. Hobbs

Douglas Appelt

John Bear

David Israel

Megumi Kameyama

Mark Stickel

چکیده

FASTUS is a system for extracting information from natural language text for entry into a database and for other applications. It works essentially as a cascaded, nondeterministic finite-state automaton. There are five stages in the operation of FASTUS. In Stage 1, names and other fixed form expressions are recognized. In Stage 2, basic noun groups, verb groups, and prepositions and some other particles are recognized. In Stage 3, certain complex noun groups and verb groups are constructed. Patterns for events of interest are identified in Stage 4 and corresponding ``event structures'' are built. In Stage 5, distinct event structures that describe the same event are identified and merged, and these are used in generating database entries. This decomposition of language processing enables the system to do exactly the right amount of domain-independent syntax, so that domain-dependent semantic and pragmatic processing can be applied to the right larger-scale structures. FASTUS is very efficient and effective, and has been used successfully in a number of applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

FASTUS: A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text

متن کامل

Presenting a method for extracting structured domain-dependent information from Farsi Web pages

Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...

متن کامل

FASTUS: A System for Extracting Information from Natural-Language Text

FASTUS is a system for extracting information from free text in English, and potentially other languages as well, for entry into a database, and potentially for other applications. It works essentially as a cascaded, nondeterministic finite state automaton. There are four steps in the operation of FASTUS. In Step 1 sentences are scanned for certain trigger words to determine whether further pro...

متن کامل

SRI : Description of the JV - FASTUS System Used for MUC - 5 Douglas

INTRODUCTION AND BACKGROUND SRI International developed an information extraction system called FASTUS 1 , a permuted acronym standing for \Finite State Automata-based Text Understanding System. The choice of acronym is somewhat misleading, however, because FASTUS is a system for information extraction, not text understanding. The former problem is much simpler and more tractable, characterized...

متن کامل

Natural language processing methods for extracting information from mathematical texts

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1996

FASTUS: Extracting Information from Natural- Language Texts

نویسندگان

چکیده

منابع مشابه

FASTUS: A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text

Presenting a method for extracting structured domain-dependent information from Farsi Web pages

FASTUS: A System for Extracting Information from Natural-Language Text

SRI : Description of the JV - FASTUS System Used for MUC - 5 Douglas

Natural language processing methods for extracting information from mathematical texts

عنوان ژورنال:

اشتراک گذاری